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Abstract 

Background: Drug prescription practices depend on several factors related to the patient, health worker and health 
facilities. A better understanding of the factors influencing prescription patterns is essential to develop strategies to 
mitigate the negative consequences associated with poor practices in both the public and private sectors. 

Methods: A cross-sectional study was conducted in rural Tanzania among patients attending health facilities, and 
health workers. Patients, health workers and health facilities-related factors with the potential to influence drug 
prescription patterns were used to build a model of key predictors. Standard data mining methodology of 
classification tree analysis was used to define the importance of the different factors on prescription patterns. 

Results: This analysis included 1,470 patients and 71 health workers practicing in 30 health facilities. Patients were 
mostly treated in dispensaries. Twenty two variables were used to construct two classification tree models: one for 
polypharmacy (prescription of >3 drugs) on a single clinic visit and one for co-prescription of 
artemether-lumefantrine (AL) with antibiotics. The most important predictor of polypharmacy was the diagnosis 
of several illnesses. Polypharmacy was also associated with little or no supervision of the health workers, 
administration of AL and private facilities. Co-prescription of AL with antibiotics was more frequent in children 
under five years of age and the other important predictors were transmission season, mode of diagnosis and the 
location of the health facility. 

Conclusion: Standard data mining methodology is an easy-to-implement analytical approach that can be useful for 
decision-making. Polypharmacy is mainly due to the diagnosis of multiple illnesses. 
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Background 

Irrational drug use or misuse means the distribution or 
consumption of drugs in ways that negate or reduce 
their efficacy or in situations where they are unlikely to 
have the desired effect [1]. Besides being an economic 
burden for the patients, communities and health sys- 
tems, it may result in drug resistance, ineffective treat- 
ment and adverse drug events. According to the World 
Health Organization (WHO) the 'rational use of drugs 
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requires that patients receive medications appropriate to 
their clinical needs, in doses that meet their own individ- 
ual requirements for an adequate period of time, and at 
the lowest cost to them and their community' [2]. Ir- 
rational drug use is a widespread practice in developing 
counties [3], though it occurs in high-income countries, 
e g, antibiotics [4,5]. 

Tanzania revised its malaria treatment policy in 2006 
and replaced sulphadoxine-pyrimethamine (SP) with 
artemether-lumefantrine (AL) as first-line treatment for 
uncomplicated malaria [6], complying with the WHO 
guidelines [7]. Such policy change was necessary because 
of the decreasing efficacy of SP [6]. The success of a new 
treatment policy partly depends on the compliance of 



© 2012 Kajungu et al.; licensee BioMed Central ttd. This is an Open Access article distributed under the terms of the Creative 
Commons Attribution ticense (http://creativecommons.Org/licenses/by/2.0), which permits unrestricted use, distribution, and 
reproduction in any medium, provided the original work is properly cited. 



Kajungu et al. Malaria Journal 2012, 11:311 
http://www.malariajournal.eom/content/1 1/1/31 1 



Page 2 of 1 1 



prescribers' to the national guidelines and on the 
patients' adherence to the treatment [8]. Irrational treat- 
ment practices are more common in the private than in 
the public sector and in most developing countries the 
private sector manages over half of all malaria cases [9]. 
Prompt and effective treatment of malaria patients is 
one of the cornerstones of the global efforts to reduce 
malaria morbidity and mortality. As many countries in 
sub-Saharan Africa are introducing relatively expensive 
artemisinin-based combination therapy (ACT) into the 
formal health system for uncomplicated malaria, policy 
makers, programme managers and donors are interested 
in assessing the quality of malaria case management with 
ACT, in particular ensuring that malaria patients are ap- 
propriately treated with an ACT [10]. 

The INDEPTH effectiveness and safety studies of anti- 
malarial drugs in Africa (INESS) project is a Phase IV 
study on both existing and new combination anti-malarial 
therapies in at least seven INDEPTH demographic sur- 
veillance system (DSS) sites in four African countries. This 
project effectively creates the missing final section of the 
drug development pipeline for Africa by ensuring local 
evidence on treatment effectiveness. The purpose is to 
minimize the time gap between licensure and adoption of 
new anti-malarials by providing objective endemic- 
country effectiveness data that will help inform global and 
national policy and practice. This project also enhances 
the capacity of Africa to monitor local health systems 
costs, effective coverage, and effects of new or alternative 
post-registration anti-malarial treatments. 

The health facility survey conducted as part of the 
evaluation of system effectiveness of ACT, generated data 
on drug prescriptions from patients and health workers in 
rural health facilities, either private or public. This analysis 
besides describing the drug prescription patterns illus- 
trates the use of standard data mining methods of classifi- 
cation tree analyses for investigating the influence specific 
patient, health care provider or health facility characteris- 
tics may have on drug prescription. 

Methods 

Study area and population 

The study was conducted in March and October 2010 
on patients and health workers in health facilities located 
within the Rufiji and Ifakara health and demographic 
surveillance system (HDSS) areas. Rufiji district in the 
coast region has about 182,000 inhabitants and a HDSS 
in place since 1998, covering a population of about 
85,000 inhabitants [11]. The Ifakara HDSS is located in 
southern Tanzania and covers two districts (Kilombero 
and Ulanga) in Morogoro region, with a population of 
about 99,000 people [12]. All government and non- 
government health facilities providing outpatient care 
within the HDSS areas were included (16 in Rufiji and 



14 in Ifakara). Investigators visited each facility for two 
to three days and collected information on attending 
patients. The target sample size was 720 patients per 
HDSS to estimate the proportion of those with uncom- 
plicated malaria correctly treated with an ACT with 10% 
precision, assuming these represented 20% of all 
patients, that 75% of malaria patients would be treated 
with an ACT, and a design effect of 2. All patients 
attending for illness the health facilities on the days of 
the survey were eligible. Health workers were following 
the national guidelines for diagnosis and treatment of 
malaria. 

Data collection 

After providing their consent to be included in the sur- 
vey, patients were interviewed prior to leaving the health 
facility using standardized questionnaires developed in 
English and translated into Kiswahili. Information on 
history of fever, health worker's diagnoses, laboratory 
tests, medications prescribed and counselling messages 
was collected. Pregnant patients were excluded from the 
survey. Health workers providing outpatient consulta- 
tions were also interviewed on pre-service training, work 
experience, in-service training and supervision visits 
received. Assessment of the level of staffing, availability 
of diagnostics, medications and other medical supplies 
was done at the health facility. A list of variables used in 
the analysis is shown in Table 1. 

Data management and analysis 

Data were double entered and validated using EpiData 
3.1 [13], cleaned, managed and analysed using STATA 
11[14] and SPSS [15]. Measures for central tendency 
and dispersion used to describe these data were mean 
and 95% confidence intervals for continuous variables 
and percentages for categorical variables. The popula- 
tion was first described on demographics and other 
variables of importance. 

This paper describes and analyses the INESS project 
health facility survey data from Tanzania. The different 
methods of data analysis applied in this paper are 
explained in the next section, starting from standard 
data mining techniques to investigate the predictors of 
polypharmacy and co-prescription of AL with an anti- 
biotic at health facilities in rural Tanzania. Polypharmacy 
is defined as a practice where a patient was prescribed 
three or more medications at a single encounter in a 
health facility or with a health worker. 

Statistical methodology 

Logistic regression is the standard technique used to 
investigate the relationship between binary response 
variables and a set of explanatory variables. However, 
with the large number of explanatory variables involved, 



Kajungu et al. Malaria Journal 2012, 11:311 
http://www.malariajournal.eom/content/1 1/1/31 1 



Page 3 of 1 1 



Table 1 Classification tree variables 



Table 1 Classification tree variables (Continued) 



Type 



Variable 



Levels 



Transmission season 



Health Type of health facility 

facility-related 



Ownership of health facility 



Any artemether lumefantrine 
available in health facility today 



Supplies to administer oral 
medications available 



Availability of any scale in the 
health facility 



No stock-outs of artemether 
lumefantrine in previous 90 days 



Health Mode of diagnosis 

worker-related 

Health workers' cadre 



Patient- 
related 



Health worker sex 



Health worker age 

Patient seen by health 
worker with medical training 



Possession of national malaria 
guideline 



Possession of national malaria wall 
chart 



Total number of diagnoses 
presented on a single visit 

Age group of the patients 



Patient treated 

with artemether lumefantrine 



Patient sex 



Hospital 

Health Centre 

Dispensary 

Public 

Private 

No 

Yes 
No 

Yes 
No 

Yes 
No 

Yes 

Laboratory confirm 

Presumptive diagnosis 

Medical Officer/ 
Doctor 

Medical Assistant/ 
Clinical Officer 

Nurse and lower 
level cadres 

Male 

Female 

Continuous variable 
No 

Yes 
No 

Yes 
No 

Yes 

Continuous variable 

Less than 5 yrs 
5-14 yrs 

1 5 yrs and above 
No 

Yes 

Male 

Female 



High season 
Low season 



Patient seen by health worker who No 
trained on use of new anti-malarial 

Yes 

Patient seen by experienced health No 
worker 

Yes 

Patient seen by health worker No 
supervised within previous six months 

Yes 

Patient seen by health worker with No 
IMCI training 

Yes 



because of potential co-linearity and interactions, it 
would be complex to investigate for each covariate the 
nature of the relationship (linear, interaction terms, 
quadratic, etc.). Moreover, with 22 variables there are 
462 possible two-way interactions and it is not feasible 
to consider all of them. For these reasons, non- 
parametric classification tree modelling methodologies 
were used [16-18]. A classification and regression tree 
(CART) analysis is a useful nonparametric data-mining 
technique. This analysis is particularly helpful when 
attempting to investigate which direct and indirect 
measures of risk are predictive of a newly emerging or 
complex disease [17]. Contrary to classical regression 
that uses linear combinations, this method does not re- 
quire the data to be linear or additive. Furthermore, 
classification tree analysis does not require to pre- 
define possible interactions between factors [18]. 
Therefore, the resulting classification trees accommodate 
in an intuitive manner more flexible relationships among 
variables, missing covariate values, multi-colinearity, and 
outliers [19]. When values for some predictive factors are 
missing, they can be estimated using other predictor 
("surrogate") variables, permitting the use of incomplete 
data sets when generating regression trees [16,18]. An- 
other advantage of classification tree analysis (compared 
with a classical multivariate regression analysis) is that it 
allows for the calculation of the overall discriminatory 
power, or relative importance, of each explanatory 
variable. 



Classification tree modelling 

To gain more insight into factors related to dependant 
variables of polypharmacy (prescription of >3 drugs) and 
co-prescription of AL with antibiotics in rural health facil- 
ities, the binary classification tree modelling methodology 
as introduced by Breiman [18] was used. Classification 
trees are widely used in applied fields as diverse as 
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medicine (diagnosis), computer science (data structures), 
botany (classification), and psychology (decision theory) 
[18-20]. Classification trees are popular in applied fields 
partly because they are agreeable to graphical display and 
easy to interpret compared to the use of strict numerical 
interpretation. Flexibility and hierarchical nature are two 
important features characterizing classification trees. 

Classification tree analysis is a nonlinear and nonpara- 
metric model that is fitted by binary recursive partition- 
ing of multidimensional covariate space. The analysis 
successively splits the data set into increasingly homo- 
geneous subsets until it is stratified to meet specified 
criteria [16,18,19,21,22]. The Gini index was used as the 
splitting method, and 10-fold cross-validation was used 
to test the predictive capacity of the obtained trees. The 
classification tree method performs cross validation by 
growing maximal trees on subsets of data then calculat- 
ing error rates based on unused portions of the data 
set. To accomplish this, it divides the data set into 10 
randomly selected and roughly equal "parts," with each 
part containing a similar distribution of data from the 
populations of interest (e.g., polypharmacy vs single pre- 
scription). The method then uses the first nine parts of 
the data, constructs the largest possible tree, and uses 
the remaining 1/10 of the data to obtain initial esti- 
mates of the error rate of the selected sub-tree. The 
process is repeated using different combinations of the 
remaining nine subsets of data and a different 1/10 data 
subset to test the resulting tree. This process is repeated 
until each 1/10 subset of the data has been used as to 
test a tree that was grown using a 9/10 data subset. The 
results of the 10 minitests are then combined to calcu- 
late error rates for trees of each possible size; these 
error rates are applied to prune the tree grown using 
the entire data set. The consequence of this complex 
process is a set of fairly reliable estimates of the inde- 
pendent predictive accuracy of the tree, even when 
some of the data for independent variables are incom- 
plete, specific events are either rare or overwhelmingly 
frequent, or both. 

For each node in a generated tree, the "primary split- 
ter" is the variable that best splits the node, maximizing 
the purity of the resulting nodes. When the primary 
splitting variable is missing for an individual observation 
that observation is not discarded but, instead, a surro- 
gate splitting variable is sought. A surrogate splitter is a 
variable whose pattern within the data set, relative to the 
outcome variable, is similar to the primary splitter. Thus, 
the program uses the best available information in the 
face of missing values. In data sets of reasonable quality, 
this allows all observations to be used. This is a signifi- 
cant advantage over more traditional multivariate regres- 
sion modeling, in which observations missing any of the 
predictor variables are often discarded [17-19]. 



Variable relative importance ranking 

One of the goals of classification tree analysis is to de- 
velop a simple tree structure for predicting data, result- 
ing in relatively few variables that appear explicitly as 
splitters, a result that may suggest that the other vari- 
ables are not important in understanding or predicting 
the dependent variable. However, unlike a linear regres- 
sion model, a variable in a classification tree modelling 
can be considered highly important even if it never 
appears as a node splitter [19,21,23,24]. Because the 
method keeps track of 'surrogate' splits in the tree- 
growing process, the contribution a variable can make in 
prediction is not determined only by primary splits. 

To calculate a variable importance score, the classifica- 
tion tree analysis method looks at the improvement 
measure attributable to each variable in its role as either 
a primary or a surrogate splitter. The values of ALL 
these improvements are summed over each node and to- 
talled, and are then scaled relative to the best performing 
variable. The variable with the highest sum of improve- 
ments is scored 100, and all other variables will have de- 
creasing lower scores. The importance score measures a 
variable's ability to perform in a specific tree of a specific 
size either as a primary splitter or as a 'surrogate' splitter. 
The relative importance ranking of variables tends to 
change dramatically when comparing trees of substan- 
tially different sizes. Therefore, the importance scores 
(rankings) are strictly relative to a given tree structure 
and should not be interpreted as the absolute informa- 
tion value of a variable. 

The TREE command in SPSS [15] was used to gener- 
ate the classification trees showing the classification 
rules generated through recursive partitioning and rela- 
tive variable importance. 

Ethical clearance 

The study received ethical clearance from the Ifakara 
Health Institute ethical review board (IHI/IRB/No.A67- 
2009) and national ethical clearance after having met the 
criteria for ethical considerations. 

Results 

Description of patient characteristics by health facility 
ownership 

A total of 1,470 patients attended the outpatients depart- 
ment (OPD) of health facilities, most of them (1,116, 76%) 
at 18 publicly owned facilities and the rest at 12 private fa- 
cilities (Table 2). More than half (53.5%) of the patients 
were less than five years old and 53% were females. The 
majority (71%) of patients attended OPD of a peripheral 
health facility while just 2.1% that of a hospital. Laboratory 
diagnosis was done in more than half of the patients 
(54%), while the rest were presumptively treated by a doc- 
tor and/or other clinical personnel. Laboratory tests were 
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Table 2 Patient characteristics by health facility ownership 



Public n = 1,116 (76%) 



Private n = 354 (24%) 



Total n = 1,470 



N 



% 



N 



% 



N 



% 



Gender 

Male 507 45.4 

Female 609 54.6 
Age groups 

Less than 5 597 53.5 

5 to 14 189 16.9 

15 + 330 29.6 
Health facility type 

Dispensary 713 63.9 

Health centre 403 36.1 

Hospital 0 0 
Diagnosis mode 

Laboratory confirmation 499 44.7 

Presumptive 386 34.6 

Missing 231 20.7 

Total number of drugs taken on this visit; mean (9596CI) 2.2(2.1-2.3) 
Number of drugs taken grouped 

One 178 16.0 

Two 558 50.0 

Three 311 27.9 

More than three 59 5.3 

Missing 10 0.9 



1/5 
179 

104 

56 

194 

323 

0 

31 

295 

36 

23 

2.5(2.4-2.6) 

42 

143 

121 

44 

4 



49.4 
50.6 

29.4 
15.8 
54.8 



83.3 
10.2 
6.5 



40.4 
34.2 
12.4 
1.1 



682 
788 

701 
245 
524 

1036 

403 

31 

794 
422 
254 

2.3(2.2-2.3) 

220 
701 
432 
103 
14 



46.4 
53.6 

47.7 
16.7 
35.6 

70.5 
27.4 
2.1 

54.0 
28.7 
17.3 



15.0 
47.7 
29.4 
7.0 



more commonly used in private (83%) than in public sec- 
tor (45%) health facilities. Overall, the average number of 
drugs prescribed per encounter was 2.3 (range: 1-6) per 
patient, with private sector patients receiving more drugs 
than those in public (2.5 vs. 2.2). Only 15% of the patients 
were prescribed only one drug while 7% received more 
than three drugs on one visit (Table 2). 

Out of 1,470 patients interviewed, 14 did not have record 
of any medication taken. Of the remaining 1,456 patients 
who were prescribed at least one drug, 36.7% were pre- 
scribed three or more medicines (polypharmacy). Using the 
variables in Table 1 to fit the model, the most important 
predictor of polypharmacy was the total number of 
diagnosed illnesses at a single clinic visit - a patient-related 
factor (Table 3). This was followed by a facility related fac- 
tor - ownership (private/public) with discriminatory power 
of 36.1%. Other factors were treating a patient with first- 
line drugs (AL) with a power of 27%, health worker age 
(power: 26%), a health worker being trained in IMCI with 
anti-malarial components, a facility not experiencing AL 
stock-out in the previous 90 days (health facility-related), 
health worker gender, health worker having been super- 
vised in the previous six months (power: 11.5%), the 
remaining factors had<10% discriminatory power (Table 3). 



Table 3 Ranking of the predictors of polypharmacy by 



their overall discriminatory power 

Independent variable Power 

Total number of diagnoses per patient on a single visit 100.0 

Ownership of health facility (public/private) 36.1 

Patient treated with artemether lumefantrine 26.8 

Health worker age 26.0 

Patient seen by health worker with IMCI training 16.1 

No stockouts of artemether lumefantrine in previous 90 days 13.9 

Health worker sex 12.1 
Patient seen by health worker supervised within previous six months 1 1 .5 

Transmission season 4.8 

Availability of any scale in the health facility 4.5 
Patient seen by health worker with experience on caring patients 3.9 

Supplies to administer oral medications 2.4 

Type of health facility 1 .0 

Health workers' cadre 0.4 

Any artemether lumefantrine available in health facility today 0.1 
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The difference in discriminatory power between the top 
predictor variable and the next most important predictor 
was substantial (100% vs 36.1%). Similar results are 
obtained by the classification tree model (Figure 1). 

Patients diagnosed with more than one disease on a 
single attendance had higher chances of being prescribed 
three or more medicines: 69.6% (compared to 32.1% in 
those with one diagnosis). The next most important fac- 
tor among patients with more than one diagnosis was 
supervision of health worker within the previous six 
months where polypharmacy occurred in 77.2% patients 
treated by an unsupervised health worker as compared 



to 52% in those treated by a supervised health worker 
(p-value<0.0001). For patients with one diagnosis, the 
next splitting factor was being treated with AL whereby 
more than three drugs were prescribed to 43% of 
patients treated with AL compared to 27.3% of those not 
treated with AL. Looking further at patients with one 
diagnosis and treated with AL, those treated in a facility 
that did not experience AL stock-out in the previous 
three months were more likely to be given more than 
three drugs (48%) compared to 25% of those served in 
clinics that experienced AL stock outs in the previous 
three months but this difference was not significant 



Category 


% n 


No 


63.3 921 


Yes 


36.7 535 


Total 


100.0 1456 



Total number of diagnoses per patient on 
a single clinic visit 



Less than two 



Two and above 



Category 


% n 


No 


67.9 866 


Yes 


32.1 409 


Total 


87 6 1275 


H 



Patient treated with artemether 
lumefantrine (AL) 



Category 


% 


n 


No 


30.4 


55 


Yes 


69.6 


126 


Total 


12.4 


181 





Seen by health worker supervised in 
previous six months 



NO 



Yes 



r 

NO 



Yes 



Category 


% 


n 


NO 


72.7 


646 


Yes 


27.3 


243 


Total 


61.1 


889 




Id 



Category 


% 


n 


NO 


57.0 


220 


Yes 


43.0 


166 


Total 


26.5 


386 


H 



Category 


% 


n 


No 


22.8 


29 


Yes 


77.2 


98 


Total 


8.7 


127 



Category 


% 


n 


NO 


48.1 


26 


Yes 


51.9 


28 


Total 


3.7 


54 



Ownership of health facility 



No stockouts of artemether 
lumefantrine in previous 90 days 



Private 



Public 



NO 



Yes 



Category 


% 


n 


No 


53.0 


97 


Yes 


470 


86 


Total 


126 


183 



Category 


% 


n 


No 


77.8 


549 


Yes 


22 2 


157 


Total 


48.5 


706 



Category 


% 


n 


No 


75.3 


61 


Yes 


24.7 


20 


Total 


56 


81 



Category 


% 


n 


No 


52.1 


159 


Yes 


479 


146 


Total 


20.9 


305 



Figure 1 Classification tree representing the predictors of polypharmacy prevalence. Note: Inside the boxes (nodes); Yes ; 
equal to three drugs (Polypharmacy), No = Less than three drugs. 



Greater or 
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(p-value = 0.124). In patients with one diagnosis not 
treated with AL, ownership of facility was the next split- 
ting factor where those served in privately owned facil- 
ities were more likely to receive three or more drugs 
compared to those in public facilities (47% vs 22.2%) and 
this was statistically significant (p-value = 0.003). Over- 
all, the classification tree for polypharmacy (Figure 1) 
had a sensitivity of 67% and a specificity of 66%. Health 
worker's age and training in Integrated Management of 
Childhood Illnesses (IMCI) may not have appeared in 
the classification tree as main splitters but are important 
variables as shown by their overall discriminatory power 
of 26% and 16.1% respectively (Table 3). 

Co-prescription of antibiotics with artemether-lumefantrine 

Overall, 84.7% (n = 1,233) of the patients interviewed had 
more than one treatment with a median number of two 
prescribed medications per patient-clinic visit (range 1-6). 
Among the 508 (34.9%) patients treated with AL, the most 
commonly prescribed concomitant medications were 
analgesics (87.4%, n = 445), followed by antibiotics (41.6%, 
n = 212) and other medications (31.6%, n = 161). Accord- 
ing to the overall discriminatory power from the classifica- 
tion tree analysis, patient age emerged as the strongest 
overall risk factor for co-prescription of AL with an anti- 
biotic, closely followed by the season of the interview 
(power: 93%), mode of diagnosis (power: 90%), availability 
of national malaria guideline (power: 84.5%) (Table 4). 

Classification tree modelling for co-prescription of 
artemether-lumefantrine with antibiotics 

The classification tree partitioned the predictors according 
to the overall discriminatory power of variables (Figure 2). 
In modelling the co-prescription of AL with antibiotics, 
patient clinical status (number of diagnoses) was found 
to be very important and when included in the model it 
masked the effect of other variables. For that reason it 
was deliberately removed so as to examine and assess 
the importance of other factors which may not be as 
obvious. The classification tree for co-prescription of AL 
with antibiotics has the most important predictors as 
patient age, transmission season, mode of diagnosis and 
location of health facility in terms of HDSS. Co- 
prescription of AL with antibiotics was done for 41.7% 
of the patients visiting the clinics during the study 
period. This was common among patients aged less than 
five years, at 47.9% compared to 35.2% of those aged 
five years and above. For the older patients (>5 year), 
this co-prescription was common in those treated after 
being tested for malaria (42.1%) compared to those pre- 
sumptively treated (28.6%), though the difference was 
not statistically significant (P-value = 0.1471). In the 
under fives, co-prescription occurred more frequently 
during the high (55.6%) than in the low malaria 



Table 4 Ranking the predictors of co-prescription of 



artemether-lumefantrine with antibiotics 

Independent variable Power 

Age group of the patients 100.0 

Transmission season 92.6 

Mode of diagnosis 90.4 

Possession of national malaria guideline 84.5 

Possession of job aids 78.7 

Patient seen by health worker with IMCI training 77.6 

Patient seen by health worker supervised within previous six months 67.0 

Patient seen by health worker with experience 40.2 
Patient seen health worker who trained on use of new anti-malarial 36.8 

Health worker sex 32.3 

Possession of national malaria wall chart 26.9 

Health worker age 18.9 

Health workers' cadre 10.6 

Patient seen by health worker with medical training 9.1 

Ownership of health facility (public/private) 8.6 



transmission season (38.5%). The location of the facility 
was important whereby patients treated in the high mal- 
aria transmission season at a facility in the Ifakara HDSS 
catchment area were more likely to be co-prescribed 
with AL and an antibiotic (62.5%) than those served in 
Rufiji HDSS catchment area (46.9%) (P-value = 0.0086). 
Some variables did not appear as main splitters in the 
tree despite their overall high discriminatory power 
(Table 4). This is due to the fact that they are important 
at several stages of the classification building tree but 
never as important as the main splitter. 

Discussion 

The first step to improve the rational use of drugs is to 
understand prescribing patterns. This paper demonstrates 
the application of classification tree analysis models a 
non-parametric modelling methodology to explore factors 
influencing drug prescription practices in health facilities 
of rural Tanzania. Classification trees are user friendly and 
easy to interpret and have been utilized to identify the 
main risk factors for malaria infection in Burundi and 
Vietnam [21,23]. In this analysis, the classification tree 
method revealed logical results of the relationships be- 
tween the outcomes of interest (polypharmacy and co- 
prescription of AL with antibiotics) and the predictor 
variables. 

While multinomial models reveal factors that predict 
the outcome in the whole population, classification 
tree analysis helps in detecting population segments 
that need specific attention in relation to the outcome. 
Segmenting populations supports decision makers in 
targeting their efforts to specific subgroups. It is 
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Figure 2 Classification tree representing the predictors of co-prescription of artemether-lumefantrine (First line drug) with an 
antibiotic. Note: Inside the boxes (nodes); Yes = Co-prescription of AL with antibiotics, No = No co-prescription of AL with antibiotics. 



important to note that this analysis does not support 
any claim of superiority of one methodology compared 
to the other. 

This analysis demonstrated some real-life treatment 
practices at the facilities. It is common for most patients 
to report with more than one complaint, which compels 
the health worker to prescribe more than one medica- 
tion for each identified illness. The IMCI strategy was 
introduced by WHO to reduce child morbidity and 



mortality. Indeed, treatment of childhood illness may 
also be complicated by the need to combine therapy for 
several conditions [25]. It is therefore not surprising that 
the total number of diagnoses was the most important 
predictor of polypharmacy as revealed by both its rank- 
ing in terms of importance and being a major splitter in 
the classification tree. A plausible explanation is that 
health workers are insecure about the diagnosis since in 
most cases the available laboratory services are unable to 
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accurately determine the cause of illness. Therefore, to 
satisfy the patients, the health worker prescribes more 
drugs and then justifies this practice by diagnosing sev- 
eral pathological conditions. 

Supervision of health workers is another important 
predictor of polypharmacy. Indeed, polypharmacy was 
more common among unsupervised health workers. It is 
worth noting that this study did not comprehensively ex- 
plore the type of supervision, limiting the conclusions 
on the possible consequences of not having adequate 
supervision on prescription practices. Nonetheless, it 
would be advisable for district health authorities to in- 
clude drug prescription practices during their routine 
supervision visits. 

Polypharmacy is common in the private sector where 
individual motivation and incentives may have prepon- 
derance over the knowledge and skills of the providers. 
Health worker age, sex, being trained in IMICI and 
being supervised in previous six months were the health 
worker-related variables identified as the other import- 
ant variables that explain polypharmacy. The observa- 
tion that patients treated with AL in a clinic that did not 
experience stock-outs of artemether-lumefantrine are 
more likely to be prescribed several treatments is 
expected and could be due to the IMCI strategy, which 
recommends this practice, especially for children pre- 
senting with multiple symptoms. 

In general, presumptive diagnosis was common in 
public facilities while laboratory results were more used 
in privately owned facilities. In SSA, it is common prac- 
tice not to use the test result when treating fever cases 
[26]. The practice of presumptive treatment for malaria 
has been and is still being practiced in several health fa- 
cilities, both in rural- and urban-based centres, because 
the syndromic treatment for febrile illnesses has been 
standard practice for long time, and clinicians mistrust 
the laboratory results due to poor quality of the labora- 
tory tests. Patients with a negative malaria test are still 
treated with an anti-malarial on the grounds that signs 
and symptoms are compatible with the diagnosis of mal- 
aria. This continues even after the introduction of mal- 
aria rapid diagnostic tests. [26] 

The recommended first-line anti-malarial drug (AL) 
was more commonly used in public than in private facil- 
ities as the former are supplied with essential drugs dir- 
ectly by the central pharmacy. This may change with the 
introduction of the Affordable Medicine Facility for mal- 
aria (AMFm) strategy in Tanzania whose approach is to 
supply subsidized AL to the private sector [27]. It will 
therefore be interesting to look at how these changes in 
the health system will affect the prescription patterns in 
Tanzania and other African countries over time. 

There was a high level of co-prescription of antibiotics 
with AL, particularly in children less than five years 



living around the Ifakara area. A study in Ghana con- 
ducted predominantly in government facilities in an 
urban setting showed that 30.8% of patients were receiv- 
ing at least one antibiotic in addition to the recom- 
mended anti-malarial [28]. Co-prescription with 
antibiotics is a life saving practice and commonly prac- 
ticed in health facilities in sub-Saharan Africa since 
patients can present with multiple illnesses at a single 
clinic visit. This is why it was promoted under the IMCI 
strategy. However, this has implications for the patient's 
safety as it may increase the risk of drug-drug interac- 
tions (D-DI), therapeutic failure, drug resistance and ad- 
verse events [29]. If this practice of co-prescription of 
drugs, which is common in rural health facilities, is not 
addressed, it may cause a major problem as the risk of 
adverse drug events (ADE) increases with an increasing 
number of medicines prescribed [30-33]. 

Classification tree analysis models are useful in 
expressing relationships between variables since they do 
not need to be linear or additive and the possible inter- 
actions do not need to be pre-specified or of a particu- 
lar multiplicative form. Results are presented in the 
form of a decision tree, a different approach than the 
standard statistical analysis. The results highlight areas 
that merit further attention and can act as a guide for 
further epidemiological and hypothesis-driven research. 
The classification trees provide a more flexible relation- 
ship between variables; missing values of the covariates, 
multi-colinearity and outliers are taken care of in an in- 
tuitively and correct manner [19]. This methodology has 
proven its usefulness and adequacy in other areas and 
contexts, for example the bee colony collapse disorder, 
bovine spongiform encephalopathy and analysis of urban 
farming systems in central Africa [17,19,20]. In malaria, 
this method has been used for ranking highland malaria 
risk factors in Burundi and in Vietnam [21,23]. However, 
it has a limitation of not providing p-values and standard 
deviations as in familiar parametric methods. Another 
limitation is that confounding values make classification 
tasks more difficult. Although this decreases true positive 
rates and accuracies, the constructed classification trees 
are valuable. The benefit of the trees is that they simulate 
more the real life situation with patients who have con- 
founding attributes. Future work should be aimed at 
finding different ways to handle confounding values in 
the reasoning process. Another advantage is that the im- 
portance of the variable can still be seen in the variable 
relative importance. 

A variable may be ranked among the top ones for the 
discriminatory power but may not appear as an important 
splitter in the classification tree, e g, training in IMCI with 
anti-malarial component. This happens because it is an 
important surrogate but not a major splitter. The ranking 
by overall discriminatory power is determined by the sum 
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across all nodes in the tree of the improvement score that 
the predictor has when it acts as a primary or a surrogate 
splitter. Consequently, a health worker having IMCI train- 
ing with anti-malarial component enters the tree as the 
top surrogate splitter in many nodes but never as primary 
splitter. 

Initiatives like the INESS Phase IV platform, working 
within communities through the HDSS system should 
continue to evaluate the effect of provider practice on 
new and old products and may be extended to other 
therapeutic areas such as ARVs, anti-TBs, antibiotics 
and vaccines. Inclusion of the private sector, e g, private 
pharmacies, retail shops, mobile drug sellers and even 
traditional herbalists will provide public health managers 
with more evidence on which to base their decisions. 

Conclusions 

The classification tree analysis approach can be used to 
classify prescription patterns using health facility informa- 
tion. This procedure offers an opportunity to examine alter- 
native methods of identifying predictors of prescription 
patterns that might assist decision makers to improve 
targeted service provision factors. This study has demon- 
strated that polypharmacy is mainly associated with mul- 
tiple diagnoses while co-prescription of AL with antibiotics 
is mainly associated with patient age. Although these are 
considered life saving practices, they expose patients to risks 
of adverse drug reactions. Bacterial antibiotic resistance 
should be looked as a public health emergency and the two 
competing causes of increased bacterial resistance are 
irrational antibiotic use and availability of poor quality anti- 
biotics. Drug prescription practices may improve by intro- 
ducing targeted interventions such as regular supervision of 
health care providers in both public and private health 
facilities. 
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